CDS

Accession Number TCMCG075C23690
gbkey CDS
Protein Id XP_007018417.1
Location complement(join(4061360..4061464,4061602..4061880,4061988..4062263,4062417..4062635,4062776..4062944,4063153..4063327,4063454..4063565,4064786..4065022))
Gene LOC18591923
GeneID 18591923
Organism Theobroma cacao

Protein

Length 523aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007018355.2
Definition PREDICTED: squalene monooxygenase [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category I
Description squalene
KEGG_TC -
KEGG_Module -
KEGG_Reaction R02874        [VIEW IN KEGG]
KEGG_rclass RC00201        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K00511        [VIEW IN KEGG]
EC 1.14.14.17        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00100        [VIEW IN KEGG]
ko00909        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01130        [VIEW IN KEGG]
map00100        [VIEW IN KEGG]
map00909        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01130        [VIEW IN KEGG]
GOs GO:0001101        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005783        [VIEW IN EMBL-EBI]
GO:0006629        [VIEW IN EMBL-EBI]
GO:0006694        [VIEW IN EMBL-EBI]
GO:0006950        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0008202        [VIEW IN EMBL-EBI]
GO:0008610        [VIEW IN EMBL-EBI]
GO:0009058        [VIEW IN EMBL-EBI]
GO:0009414        [VIEW IN EMBL-EBI]
GO:0009415        [VIEW IN EMBL-EBI]
GO:0009628        [VIEW IN EMBL-EBI]
GO:0010035        [VIEW IN EMBL-EBI]
GO:0012505        [VIEW IN EMBL-EBI]
GO:0016125        [VIEW IN EMBL-EBI]
GO:0016126        [VIEW IN EMBL-EBI]
GO:0042221        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0050896        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:1901360        [VIEW IN EMBL-EBI]
GO:1901362        [VIEW IN EMBL-EBI]
GO:1901576        [VIEW IN EMBL-EBI]
GO:1901615        [VIEW IN EMBL-EBI]
GO:1901617        [VIEW IN EMBL-EBI]
GO:1901700        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCGGATTCGTACGTGTGGGGATGGATCTTGGGCTCCGTGATGACGCTGGTTGCGTTGTGCGGTGTCGTTTTAAAGAGACGGAAAGGGAGCGGAATCTCCGCCACGAGGACTGAGTCCGTGAAGTGCGTCTCCTCGATCAACGGAAAATGCAGATCCGCTGACGGTAGCGACGCTGATGTCATCATCGTTGGAGCTGGCGTTGCTGGCTCCGCCCTCGCTCACACTCTCGGCAAGGATGGACGTCGAGTGCATGTGATTGAAAGAGACTTGACAGAGCCTGACCGTATTGTTGGAGAATTGCTACAACCAGGGGGCTATCTAAAGTTAATTGAGTTGGGACTTGAAGATTGTGTGGAGGAAATTGATGCTCAGCAGGTATTTGGTTATGCTCTTTTCAAAGATGGGAAGCATACCCGACTTTCTTATCCCTTAGAAAAGTTCCACTCAGATGTATCTGGTAGGAGCTTTCATAATGGACGTTTCATACAGAGGATGAGGGAGAAATCAGCTTCTCTTCCCAATGTACGTTTGGAGCAAGGGACAGTTACTTCTCTACTTGAAGAAAAGGGAACAATTAGAGGAGTCCAGTACAAAACTAAAGATGGCAGGGAATTGACAGCATTTGCACCCCTGACCATTGTCTGTGACGGTTGTTTTTCAAACTTGCGTCGCTCCCTTTGCAACCCTAAGGTAGATGTACCCTCTTGTTTTGTGGGATTGGTCCTGGAGAATTGCAATCTTCCATACTCAAATCATGGGCATGTTATACTAGCAGATCCTTCCCCCATTTTGTTCTATCCTATCAGCAGCACAGAGGTTCGCTGTCTGGTTGATGTACCTGGTCAGAAGGTTCCTTCTATTGCAAATGGGGAGATGGCAAATTATCTGAAGACCATTGTGGCTCCTCAGGTTCCTCCAGAAATCTACAATTCATTTGTAGCAGCTGTTGATAAGGGAAATATTAGGACAATGCCAAACAGGAGCATGCCAGCTGCTCCTTATCCCACTCCTGGAGCCCTATTAATGGGAGATGCATTCAACATGCGCCATCCATTAACTGGTGGAGGAATGACTGTGGCATTATCAGATATTGTTGTCCTCCGCGATCTACTAAGGCCTCTGCGTGACCTCAATGATGCACCTACCCTCTGCAAATATCTTGAATCATTTTACACCTTGCGTAAGCCTATAGCATCTACTATCAATACCTTGGCAGGTGCCTTGTATAAGGTGTTCTGTGCTTCACCTGATCAAGCAAGGAAAGAAATGCGTCAGGCTTGCTTCGATTATCTAAGCCTTGGTGGTGTATTCTCAACAGGACCTATCTCTTTGCTCTCTGGTTTGAACCCTCGCCCTGTGAGCTTGGTTCTGCATTTCTTTGCTGTGGCAATATATGGTGTTGGTCGTTTACTATTGCCGTTCCCTTCACCTAAGCGAATCTGGATTGGAGCTAGGCTGATCTCGGGAGCGTCAGGAATCATCTTCCCAATTATCAAGGCAGAAGGAGTTAGGCAAATGTTTTTCCCTGCAACTGTTCCTGCATATTACAGAGCTCCTCCTGTTGAGTGA
Protein:  
MADSYVWGWILGSVMTLVALCGVVLKRRKGSGISATRTESVKCVSSINGKCRSADGSDADVIIVGAGVAGSALAHTLGKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLIELGLEDCVEEIDAQQVFGYALFKDGKHTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKSASLPNVRLEQGTVTSLLEEKGTIRGVQYKTKDGRELTAFAPLTIVCDGCFSNLRRSLCNPKVDVPSCFVGLVLENCNLPYSNHGHVILADPSPILFYPISSTEVRCLVDVPGQKVPSIANGEMANYLKTIVAPQVPPEIYNSFVAAVDKGNIRTMPNRSMPAAPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLRPLRDLNDAPTLCKYLESFYTLRKPIASTINTLAGALYKVFCASPDQARKEMRQACFDYLSLGGVFSTGPISLLSGLNPRPVSLVLHFFAVAIYGVGRLLLPFPSPKRIWIGARLISGASGIIFPIIKAEGVRQMFFPATVPAYYRAPPVE